OPTIMAL JOINT MEASUREMENTS OF POSITION AND 

MOMENTUM 



D M APPLEBY 

Department of Physics, Queen Mary and Westfield College, Mile End Rd, London 

El 4NS, UK 

(e-mail: D.M.Appleby@qmw.ac.uk) 



Abstract 

The distribution of measured values for maximally accurate, unbi- 
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vestigated. It is shown, that if the measurement is retrodictively 
optimal, then the distribution of results is given by the initial state 
Husimi function (or Q- representation). If the measurement is predic- 
tively optimal, then the distribution of results is related to the final 
state anti-Husimi function (or P-representation) . The significance of 
this universal property for the interpretation of the Husimi function 
is discussed. 
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1. Introduction 

There is currently some interest in simultaneous measurements of position and 
momentum [Q, ||, ||, ||, §|. Measurements of this kind have an immediate, 
technical relevance to the field of quantum optics. They also have a rather more 
general, conceptual relevance to the problem of understanding the classical limit. 

In two previous papers (!(], |ll| we discussed the accuracy of such measurements. 
We began with Braginsky and Khalili's analysis |L2| of single measurements of 
x only, and extended it to the case of simultaneous measurements of x and p 
together. We identified two types of error: the retrodictive (or determinative) 
errors A e ;x, A e ;p; and the predictive (or preparative) errors A e fX, A e fp. We showed, 
that subject to some rather unrestrictive assumptions regarding the nature of the 
measurement process, they satisfy the retrodictive error relationship 

ft 

A oi x A cl p > - 
and the predictive error relationship 

ft 

A ef x A ei p > - 

In the following we address the question: what (if anything) can be said about the 
distribution of measured values in those cases where the lower bound set by one of 
these inequalities is actually achieved? 

We begin, in Section |^, by considering measurements which are retrodictively 
optimal. We define a retrodictively optimal measurement to be any measurement 
belonging to the class of processes defined in ref. which minimises the product 
of retrodictive errors (so that A e jX A e jp = f ), and which is retrodictively unbiased 
[so that the systematic errors of retrodiction are zero — see Eq. ([!]) below]. We show, 
that for such measurements, the distribution of measured values is always given by 
the initial system state Husimi function |l3[ Q. This result is the extension, to the 
general class of measurement processes defined in ref. of the result proved by 
Ali and Prugovecki @, |l5| for the case of measurement processes which are Galilean 
covariant, and (using rather different methods) in ref. ]l6| for the particular case 
of the Arthurs-Kelly process. 

A number of related results have been obtained by other authors. In the case 
of the Arthurs-Kelly process several authors p], || have shown, that the Husimi 
function describes the distribution of measured values for certain choices of initial 
apparatus state. Leonhardt and Paul || have shown that the same is true for 
a number of other processes. However, these authors all confine themselves to 
particular examples of simultaneous measurement processes. They do not consider 
measurement processes in general. Moreover, they do not relate the distribution of 
measured values to the accuracy of the measurement process. In particular, they do 
not show that the Husimi function describes the distribution of results in precisely 
those cases where the measurement is retrodictively "optimal" or "best" . 

Wodkiewicz has proposed an operational approach to the problem of phase space 
measurement || |?J . If one takes the filter reference state (or "quantum ruler" ) used 
to define his operational distribution to be a squeezed vacuum state, and a minimum 
uncertainty state for x and p, then one obtains the Husimi function. It could be 
said that the Husimi function is the operational distribution corresponding to the 
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case when the quantum ruler is most exactly and finely calibrated — a fact which 
obviously ties in with the result which we prove in Section || below. 

However, the result which is most similar to ours is the one obtained by Ali and 
Prugovecki |)| |ll| , working within the framework of the approach based on POVM's 
(positive operator valued measures) and unsharp observables. In fact, their result 
is the same as ours, except that we prove it under much less restrictive conditions 
(unlike Ali and Prugovecki we do not assume Galilean covariance. Galilean covari- 
ance is a consequence of the result which we prove, not a presupposition). It may 
also be worth remarking that our way of analysing the concept of a simultaneous 
measurement process is rather different from theirs. In particular, the objections 
recently raised by Uffink |L7j do not apply to our arguments. 

In Section^ we go on to consider predictively optimal measurements — i.e. mea- 
surements of the type defined in ref. |Tl] ] which minimise the product of predictive 
errors (so that A e fcc A e tp = ■§). We show, that in the case of such a measurement, 
the distribution of results is related to the final state anti-Husimi function | 
(the P-function of quantum optics). This result also represents an extension, to 
the general class of measurement processes defined in ref. p"T[ , of a result proved in 
ref. for the special case of the Arthurs-Kelly process. 

In Section ^ we conclude by discussing the bearing of our results on the inter- 
pretation of the Husimi function. In Section ^| we show that the Husimi function 
describes the outcome of any retrodictively optimal process. In other words, the 
Husimi function has a universal significance. We will argue that this lends some 
support to the idea, that the Husimi function is the quantum mechanical entity 
which most nearly resembles the classical concept, of the "real" or "objective" 
distribution describing an ensemble of identically prepared systems. 



2. Retrodictively Optimal Measurements 

We will say that a simultaneous measurement process of the kind defined in 
ref. fllTf is retrodictively optimal if 

1. The process is retrodictively unbiased, so that 

(ip ® 0ap| £Xi \lp O </>ap) = (V> ® ^ap| £Pi \lp ® 0ap) = (1) 

for all \ip) G JC sy . 

2. The product of retrodictive errors achieves its lower bound, so that 

A ci x A ciP = ~ (2) 

Here and in the sequel we employ the notation and terminology of ref. JTl| . Thus, 
\ip) G JC sy and |0 ap ) G 3~C ap are the initial states of the system and apparatus re- 
spectively, exi, epi are the retrodictive error operators. A e ia;, A e ip are the maximal 
rms errors of retrodiction. 

In ref we considered the special case of the Arthurs-Kelly process. In that 
case one has the commutation relation 

[exi,epi] = -«fr (3) 

This relationship, and the condition of Eq. (|2|), together imply Eq. ([!]). In the gen- 
eral case, however, it is necessary to impose the requirement, that the measurement 
be retrodictively unbiased, as a separate condition. 



In the general case the case the commutation relationship of Eq. (|^) cannot 
be assumed. However, it was shown in ref. that Eq. ([j]) implies the weaker 
statement 

(-0 ® 0a P | [exi, epi] \t/> O ap ) = -i^ (4) 

for every normalised \ip) G Jf sy [but fixed |</> a p)]- It turns out that this is enough 
to prove, that the distribution of measured values is given by the initial system 
state Husimi function, for any retrodictively optimal process. However, the fact 
that we can no longer assume the commutation relationship of Eq. (|3|) , means that 
the proof of this statement is less straightforward than the proof given in ref. jl6| , 
for the special case of the Arthurs-Kelly process. 
In view of Eqs. (0) and (|j) we have 

h 2 

(V> ® <^ap| £ X i k ® 0ap) (ip ® 0ap| £pi (V 1 ® 0ap) = -J (5) 

for every normalised \tp) G J{ sy . We deduce: 

Lemma 1. Given any retrodictively optimal measurement process with initial ap- 
paratus state |</>ap); there exists a fixed number A; such that 

X 2 

(ip <8> ^ap| e X i |V> ® 0ap) = y 



2A? 



/or every normalised G JC, 



sy • 



Remark. We will refer to Aj as the retrodictive spatial resolution of the measure- 
ment. 

Proof. For each normalised G 5£ sy define the number A^, by 

i 

A V , = (2 (ip <g> 0ap| £» |^ ® 0ap)) 2 

In view of Eq. (|^) we then have 

</>ap| 4i |V"8)0ap)) 5 = ''' 



\f2X^ 

We have from the definitions (IT] of A C iX, A C ip 

( I 12 1 \\^ ^Upi^/A^g (A-^) 

A oi a; = sup I Oip <g> ap e X i V> ® 0a P ) = 7= 



|-0>G§ 

and 



1 ^- 

A ci p = SUp ( (V ® 0ap| E Pi 1^ <S> 0ap) ) ' = 7= . . 7— — 

|V)eS v 7 v2mt|^) e s (A^,j 

where § denotes the unit sphere in the system state space. In view of Eq. (Q) it 
then follows 

inf (A^) = sup (A^) 
IVves |V)es 

which means that must be constant. □ 
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We next define the operators 

c a. = ^ir exi ~— epi ) 

(6) 



c * = 71 U £Xi + T epi 

In the general case we cannot assume the commutation relation of Eq. (^|). It 
follows, that c A . , c\. are not, in general, ladder operators. We do, however, have 
the relationship of Eq. (|4|), and this is enough to prove 

Lemma 2. Given any retrodictively optimal measurement process with intial appa- 
ratus state |0 a p) and retrodictive spatial resolution A;, let c x . be the operator defined 
by Eq. Then 

C Aj |0 ap ) = 

/or every \tp) G Jf sy . 

Proof. Given any normalised system state |0), let a, /? € K be the real and imagi- 
nary parts of (0 <8> ap | exi epi |0 ® 0a P ): 

("0 ® ap I exi epi 10 ® ap ) = " + i/3 (7) 

We have 

i fi 
(a 2 + [3 2 ) 2 = ® </> ap | exi epi |0 ® ap ) < pxi |0 ® 0a P )|| pPi 10 ® 0a P ) I = 7: 





0<8 


0ap)) 


14 


<8 


0ap)) 



where 

pXi 10 <S> 0ap)|| = ((0 ® 0a 
pPi|0® 0ap)|| = ((0®0a 

are the norms of the vectors exi |0 <B> </>a P ), epi |0 ® </>ap)- 
In view of Eq. (^) we also have 

-ih = (0 <g) ap | [exi, epi] |0 <S> ap ) = 2«/3 

Consequently, a = and /3 = — §. We then have 



fi 



2 ■ 

fi 
2 



|(0<8>^ap| e X i£Pi |V"8)0ap)| = = 10® 0ap)|| ||epi 10 ® 0ap) 



Now it is generally true, in any Hilbert space, that two vectors ^2) having 

the property 

|<*l| *2>| = |||*1>|| |||*2>|| 

must be parallel. Hence 

e Pi |0 ® <^ ap ) = 7 exi |0 ® 0ap) 
for some 7 e C. Inserting this result into Eq. (]?]) we find 



7- X2 



A? 



The claim follows. □ 
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Now let 

p (pxf, ppf) = J dxf dyn . . . dy in \ (xf, pxi, pn, yn,---,yin\ ip ® 0a P ) | 2 (8) 

be the probability distribution for the final pointer positions. In this expression 
\xt, pxf, ppf, yn, ■ ■ ■ , Vin) is the simultaneous eigenvector of the Heisenberg picture 
operators if, (Xxi, ppf, ytj, with eigenvalues X{, pxi, ppi, yij- We continue to 
employ the notation and terminology of ref. pj] |. Thus, if is the final system 
position operator, pxt and /tpf are the final pointer position operators, and the ytj 
represent the additional, internal degrees of freedom characterising the apparatus. 
Let |(a;,p) A . ) G !K sy be the state with wave function 

(x'\ {x,p) Xi )= exp^-^ix'-xf + jrpx'-^px] (9) 



and let 



h l (x,p) = U{(x,p)M\ 2 (10) 



be the initial system state Husimi function |13, 14l. We want to show 



p(^ X f,Mpf) = Q\ (pxi,PPi) 

for almost all pxf , pp{ whenever the measurement is retrodictively optimal at spatial 
resolution A; ( "almost all" being defined relative to ordinary Lebesgue measure on 
the plane). Our strategy will be to begin by showing that the two functions have 
the same moments: 

dpxfdppf PxfPpf P 0*xf> MPf) = J dpxfdppf MxfMpf Q\ (/*xf, MPf) 

for every pair of non-negative integers n, m. Unfortunately we then face the diffi- 
culty, that although p and QAi are always defined, whatever the initial state of the 
system, the same is not true of their moments. This is because Xi, p ll pxi, PPi are 
unbounded operators. The way in which we will circumvent the difficulty is, first 
to prove the result on the assumption that is in an appropriately chosen dense 
subspace of 3i sy , and then to use a continuity argument to extend it to the case of 
arbitrary \ip). 

be the ladder operators 



1 / 1 . , A; . 
t 1/1. A;/ 



'A,' 



and define number states \n) x , £ !H sy in the usual way, by the requirements 

«aJ0)a,=0 a,(0|0) Ai = 1 \n) Xi = ^=a[? |0) Aj 

v n\ 

(with a slight abuse of notation we sometimes regard the operators i; and pi as 
acting on ^K sy , and sometimes as acting on "K sy <g> JC ap ). We then define &\ l to be 
the dense subspace of !H sy consisting of all finite linear combinations of the vectors 

l»>V 
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It is easily seen that is in the domain of definition of every polynomial 
f(xi,pi). In particular, the integral 

dxdpx n p m Q\ i (x,p) 

is defined and finite for all n, to whenever Q\ { is the Husimi function corresponding 
to a state in 3 r \ i . 

Now define the operators 

ft W i . ^ r 

These operators commute, and so they are certainly not ladder operators. We have 

K = 4, + c Ai 



&L = 4, + ^ (is) 



where c A . and d A . are the operators defined in Eqs. @ and jlj ) respectively. Let 
|^) be any vector e &\ v Then ® ap ) is in the domain of a\.. It is also in 
the domain of c A . (the definition of a retrodictively optimal process tacitly assumes 
that \ip (8 4>np} is in the domain of exi, epi, and therefore in the domain of c A , for 
all IV'))- It is consequently in the domain of b\. . Moreover, in view of Lemma 

&A, |V"H><M = (4, ® I'M 

where d A . |^) also S 3*Aj- Iterating the argument we conclude that \ip ® ap ) is in 
the domain of & A ™ and 

|V> ® <M = (aj^ ® I'M 

for every non-negative integer n. Taking adjoints gives 

(if) (g) ap | 6™ = a™) ® (0 ap | 

for all to. Consequently, 

(V> ® 0a P | 6™^," |-0 ® </> ap ) = (i/>\ a ™a A ." |V>) 

Now 

(V <S> M | &aT^a™ 1^ <S> ap ) = J dfixfdfipf z™z* x . n p (nxc ) MPf) 

where p is the distribution of final pointer positions, as defined in Eq. (|8|), and 
is the complex coordinate 

ZAl = 7! (i Mxf + T m ) (13) 

Also 

(ip\ a^al™ \ip) = dpxfdnpf z™z* x . n Q Xi (/z X f , MPf) ( 14 ) 
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where Q Xi is the initial system state Husimi function, as defined in Eq. (Q) . There- 
fore 



dpxfdppf z™z* x . n p (juxf , MPf ) = / dfj.xfdfJ.pf z^zl" Q\, ((Xxf, A*Pf) 
for all n, m. It follows that 

dpxfdp Pi f(z Xi , z* x .) p (p X f, ppt) = J dp X fdp Pi f(z Xi ,z* x .)Q Xi {pxt,ppf) 
for every polynomial /. In particular 

J dp X fdp P f /ixfMw P (MX£ , MPf ) = y dp.xidp.pf MxWf (j"Xf , A*Pf) (15) 
for all m, n. 

At this stage one needs to be careful. It is tempting to suppose, that two 
probability measures which have the same moments must be equal. In fact, this 
inference is not always justified (see Reed and Simon |19| ], vol. 2). However, it is 
justified here, as we show in the Appendix. Consequently 

p (pxf, ppf) = Q\ (pxf, PPf) (16) 

for almost all pxf, ppt whenever the initial system state is in the space 3%. 

It remains for us to show that the distributions are equal in the case of arbitrary 
\ip) € IK sy . We will do this by using a continuity argument. 

Choose a sequence \tp n ) e 1 Xi converging to |^). Let Q\ un be the Husimi 
function, and p n the distribution of measured values corresponding to \ipn}- Let 
Q Xi be the Husimi function, and p the distribution of measured values corresponding 
to \rP). 

We have, as an immediate consequence of the definition, Eq. (|l0|), 

Q\i (pxi,HPt) = lim (Qxuti (j"xf , MPf)) (I 7 ) 

n — ► oo 

for all pxf, ppf. 

On the other hand, it is not generally true that p n converges pointwise to p. It 
does, however, contain a subsequence which converges pointwise almost everywhere. 
In fact, let Li be the Banach space consisting of all integrable functions on K 2 , with 
norm 



Il/lll = / dpxfdppf\f (p X f,ppf)\ 

We have 

\\p - Pn\\i = J dpxfdppf J dxfdyf 1 ...y in (|(a:f,^xf,A*Pf,2/fi> • • • ,Vin I 4> ® 0a P )| 

- \(xf,pxf,PPf,yfl, ■•■,yin I Ipn ® <Aap)| 



< 



||-0 ® 0ap) - \lpn ® 0ap>|| f HlV" ® 0ap> || + 1 1 1 V> ® 0a 



-> 

We see from this that p n — > p in the topology of £1. We may therefore use the 
Riesz-Fisher theorem (Reed and Simon [|l9), vol. 1) to deduce that it contains a 
subsequence p rir such that 

p(pxi, PPf) = lim (/3n r (/iXfiMPf)) 



for almost all /ixf , Mpf- In view of this result, Eq. (|l7|), and the fact that 

Pn r (/*Xf,MPf) = ^Ai,n r (MXfjMPf) 

for all r and almost all /ixf , A*Pf we deduce that 

p(Mxf,MPf) = (Mxf^Pf) 

for almost all /ixf, A*pf ■ 

3. Predictively Optimal Measurements 

We will say that a simultaneous measurement process of the kind defined in 
rcf. fllTf is predictively optimal if the product of predictive errors is minimised: 

A cf xA cf p=J (18) 
In view of the commutation relation 

[exf,epf]=«fr (19) 

there is no need to impose the condition, that the measurement be predictively 
unbiased as a separate requirement: it is a consequence of the condition of Eq. (|l8|). 
Eqs. (Q and @ together imply 

(lp ® </>ap| £ X f |"0 ® <£ap) ® </>ap| Ep f ® 0ap) = "J" 

for every normalised € JC sy . By an argument which parallels the proof of 
Lemma |l| we infer that there exists a fixed number Af such that 

A 2 

(tp O ^ap| C X f \*l> ® 0ap) = y 

ft 2 

® 0ap| 4f F ® 0ap) = ^2 

for every normalised £ 5C sy . It is then straightforward to show that 

4 f |V ® ^ap> - (20) 

for all |?/>) S 9C S y, where d^f is the annihilation operator 

, If 1 . , iA f . 

Since e X f , epf are canonically conjugate there exist kets |exf , Mxf , A*Pf > Vfti ■ ■ ■ , 2/fn) e 
which are simultaneous eigenvectors of the operators e X f , Axf , Apf) Vtji ancl which 
have the property 

d 

e (exf,A*xf,MPf>2/fi: • ■ ■ ,Vin\ hi I*) = -*^q e ( e Xf i Mxf , MPf , 2/fi, ■ ■ ■ ,2/fn I *) 

oexf 

(21) 

for all | ty) £ J£ sy (g) 3-C a p . In view of Eq. ( p(i| ) we then have 
1 3 \ 

^e X f + A f ^— J e {exi, Vxs> fJ>Pf,Vn, ■ ■ ■ ,Vfn\ V> ® 0a P ) =0 







for all \ip) S 3~C sy . Solving this equation we find 

£ (exf , Mxf , MPf> Vn> ■ ■ • > J/fn I </>ap) 





1 F 2 


(^f ) 1 exp 


2Af xf . 



$(MXf,AiPf,3/fl,---,Kfn) 



(22) 



where $ is an arbitrary normalised function. 

There also exist kets \x{, /Ltxf , Hvi , J/fi, ■ ■ • , yin) x which are simultaneous eigenvec- 
tors of the operators if, /fixf, /*Pfj Vfj with the property 

a 

x (x { ,nxf,VPi,ya, ■ ■ -,yfn\ Pi |*) = x (a;f,Mxf,j"Pf,2/fi, ■ ■ • ,2/fn I *) (23) 

for all |\&) S !H sy ® IK ap . In view of the defining relation exf = Axf — if we must 
have 



(24) 

where e-'xO^xf.Mpf.j/fi,-,^) j s a phase. In view of Eqs. (|l|) and (||) we must 
then have 

x (x { ,Vxi,VPf,yn, ■ ■ ■ ,Vin\ Pi W 
9 



(e^ f '™^«'---' w ")>xf - xf , Mxf , MPf , Wi, • • • , «h I *>) 



and 



c (a!f,/zxf,MPf,2/fi> ■ ■ ■ jj/fnl Pf I*) 



e 

p ix(xf ,Mxf ,MPf ,1lti,—,Vtn) 



(Mxf - a?f, A*xf, A*pf, yn, • • ■ ,Vin\ MPf-epf I*) 



x ( MPf + ) , < \i . /'xr- /'i-i • Ha <l\„ I *}| 



exf=Mxf — Xf 



for all |*) e M sy ® 5{ ap . Hence 



which implies 



fi-^X ( X U Mxf, A*Pf > • • • , 2/fn) = MP£ 



1 



X (a* , A*xf , A*Pf , Mfi) • • • j Vtn) = TMPf^f + Xo (Mxf , A*Pf , 2/fi, • • • , Vtn) 

where xo is an arbitrary function. Using this result and Eq. (24) in Eq. (|22| ) we 
deduce, that the final state wave function can be written 

X (X{, flXU MPf) Vll, ■ ■ ■ , Vfn I 1p® 0ap) 

eX P[ _ 2X^ (AXf - =Cf) + iMPf^f + «Xo(MXf,MPf.yfl,---,yfn)] 

X $(MXf,MPf,yfl I --- ; yf™) 

In terms of the state |(^*xf; A*Pf)^ f ) defined in Eq. (^J) this becomes 
x (a;f,/ixf,MPf ! yfi,---,yfn| V'^'^ap) = (a* |(Mxf , MPf)x t ) (MXf, ftpuyn, ■ ■ ■ , Vin) 



in 



where 

(MXf,MPf,yfi, ■ • -,yfn) 
= exp[ixo (Mxf,Aipf,2/fi, ■ ■ - ,2/fn) + ^PiMxf] $ (Mxf , A*Pf , Va, ■ ■ -,2/fn) 
The distribution of measured values p (/^xf , MPf) can be written in terms of $': 

P (MXf : MPf ) = / d Z/fl ■ ■ -Z/fn l $ ' (A*Xf,j"Pf,2/fli ■ ■ ■ :2/fn)| 2 



Suppose, now, that the pointer positions are found to be in the region 31 C R 2 . Let 
/5 sy be the reduced density matrix describing the state of the system immediately 
afterwards. Then 

(xfi\ p sy \xfz) = — [ dfixfdfipfdyn . . . dy in |$' (/x X f, Ppi, Vii, yin)f 

x (xfl |(Aixf,m)A f )((A i Xf,MPf)A t 
where p;x is the probability of finding (/xxf,MPf) G 



Hence 



is 



Psy = — I dflxfdppf p{p X f,PPf) \(pxf,PPf)x t )((pxf,PPf) Xt 



On the other hand 



= / dfj.xfdp.pf P\ s (/x X f,MPf) |(Mxf,MPf)A t )((A i Xf,MPf) 



A f /\VPXi,pFi;A f | 

where Pa ( is the anti-Husimi function (or P-function) |f4|, [l^] describing the final 
state of the system. Comparing these expressions we see 

D / \ / ^-P (MXf , MPf ) if(p X f,/ipf) G # 

I otherwise 

If 3? is a sufficiently small region centred on the point (/xxfj/^pf) the system is 
approximately in the state \(/j,xf , PPf) \ { ) after the measurement: 

Psy « |(/XXf,MPf)A f )((MXf,MPf) Af | 

Eq. shows that the effect of a predictively optimal measurement process 
is to leave the system in a state for which P\ t is a probability density function. 
Such states are, of course, exceptional. In many cases, P\ t is not even defined as a 
tempered distribution Jl4[ . 

4. The Interpretation of the Husimi Function 

The result proved in Section ^ shows that there is a certain analogy between the 
Husimi function and the x-space probability density function |(a; | ip)\ 2 - To see this 
let us examine just what is meant by the statement, that ip)\ 2 5x represents 
the probability of finding the position to lie in the interval (x, x + dx). 

Consider a measurement of x only. For the sake of simplicity suppose that 
the measuring apparatus has only one degree of freedom, corresponding to the 
single pointer observable fix (the argument which follows does not depend on this 
assumption, however). Let \tp) and |0 a p) be the initial states of the system and 
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apparatus respectively, and let U be the unitary evolution operator describing the 
measurement interaction. Let x\ = x and fixf = U' PxiU be the Heisenberg picture 
operators describing the initial position of the system and final position of the 
pointer respectively. Let exi = Axf — %i be the retrodictive error operator. 
The final state wave function can be written (in the Schrodinger picture) 

(x,fi x \ U\^<^4> ap ) = Jdx'K(x,^;x') (x'\ tp) 

for some kernel K. The probability distribution describing the result of the mea- 
surement then takes the form 

2 



P (mx) = \ dx 



dx' K (x, fix', x') (x' | ip) 



(26) 



After a certain amount of algebra one also finds 



(ip ® ap | e-xi \ip® 4> ap ) = / dxdfi 



dx' (fix - x') K (x, fix; x') (x | ip) 



(27) 



Suppose that A c ia; = 0. Then we see from Eq. ( |27| ) that K must take the form 

K (x, fix; x') = / (x, fix) 5 (fix - x') 
for some function /. The unitarity of U means that / must satisfy 



dx \f (x^xW 



1 



Using these results in Eq. (E6J) we find 



p(a*x) = |(Mx I ^)\ 2 

whenever the measurement is perfectly accurate for the purposes of retrodiction. 

Suppose, on the other hand, that A e ja; > 0. Then p (/ix) will not generally co- 
incide with the function |(/ix I 4')^ ■ If ^ciX is small compared with the de Broglie 
wavelength, then we see from Eqs. © and (||) that p(fix) « \{fix\ 4>)f ' ■ Other- 
wise, we do not expect the two functions even to be approximately equal. 

Although one may possibly approach, one does not expect actually to achieve 
the limit of perfect accuracy. It follows, that one does not expect the function 
|(a*x| V>)| 2 to describe the outcome of any practically realisable measurement of 
position. 

This being so what, exactly, is the significance of the function |(/ix I V 1 )! 2 ? hi 
the first place, it serves as a standard of comparison, against which the outcome of 
experimentally realisable measurements can be judged: in the sense, that the better 
the measurement, the more closely does the function \(fj,x | ip}\ 2 approximate the 
distribution of actual results. 

In the second place, we see from Eq. (^6|) that the outcome of a real measurement 
of position depends, not only on the state of the system, via the function (x' \ ip), 
but also on the details of the measurement process, via the function K (x, fix] x'). 
In the limit of perfect retrodictive accuracy, however, the dependence on the appa- 
ratus (as represented by the kernel K) disappears, and the distribution of results 
is determined solely by the state of the system (as represented by the vector \ip))- 
\(fj,x | il>)\ 2 does, so to speak, represent the intrinsic distribution of position, inde- 
pendent of any properties specific to the particular measuring instrument employed. 
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In a real measurement, by contrast, the outcome is (in a manner of speaking) con- 
taminated by instrumental contributions, which one may try to reduce, but can 
never entirely eliminate. 

One typically regards the function \(fix I V>)| 2 simply, and without qualification, 
as the x-space probability distribution. It owes this canonical status to the two 
features just mentioned. The result proved in Section || shows that the Husimi 
function has analogous features. It describes the outcome of those measurements 
which are retrodictively optimal, or "best" . It is otherwise independent of the 
details of the particular process considered. It might therefore be regarded as the 
canonical probability distribution for position and momentum. 

In classical mechanics one has the concept of the "actual" distribution describ- 
ing an ensemble of identically prepared systems. Quantum mechanics contains no 
precise analogue for this concept (unless one adopts a "hidden-variables" inter- 
pretation pet ). Nevertheless, the result proved in Section || shows that there are 
certain resemblances between the Husimi function and the classical distribution. 
The Husimi function is clearly not the same as the classical distribution. However, 
one might reasonably argue that it is the closest that quantum mechanics allows us 
to get to the concept of a "real" or "objective" phase space probability distribution. 



Appendix. Proof of Equation (|16|) 



Rather than working in terms of the functions p, Q\ { it will be convenient, 
instead, to work in terms of the measures 

dp p = p (/ixf, MPf) dp X {dp P { 

dpq = Q\ (/ixf , MPf) duxid/ipf 

We have from Eqs. (0) and ([j]| 

dfJ-p \z\f n = / dfiQ \z Xi \ 2n = (ip\ a" a{™ \ip) (28) 



where z Xi is the complex co-ordinate defined in Eq. (|l3|). Our strategy will be, first 
to establish a bound on the rate at which these quantities grow with increasing 
ra, and then to use this to show that the measures p p , pq have the same Fourier 
transform. 

IV') is in the subspace It can therefore be written 

; 



r=0 



for some integer /. Hence 



At " l„/,\ _ (" + r ) ! i„ |2 ^ (n + l)\ 



(V'l O^OaT \i>) = y ' L ^' h \ C A 2 < 

r=0 

Let /i stand for either of the measures fi p , [iq. In view of the inequality just proved, 
Eq. (Hj) and the fact 

I |2n+l . 1 /, ,2n . | ,2n+2 

< - \z\,\ + IzxA 



2 

we have 

r{±n + i + l) 



dp \z Xi \ n < 



r(i + i) 
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for every non-negative integer n. Hence 

°° 1 f 



n=0 



for all j3, 7 G C. It follows that the functions e '' 5zA i +7ZA i' and e^ Zi i +7ZA i are /i- 
integrable. We may therefore use Lebesgue's dominated convergence theorem (Reed 
and Simon |l{J, vol. 1) to infer 



/ d\x p exp [fiz x . + jzl x ] = Hm ( E ~~i / rf ^> (/^ 



\n=0 J 

J dfiQ exp [(3z x . + 7^.] 



for all /3, 7 € C. Consequently 

y d/Xp exp [i (fcx/ixf + fcpMPf )] = y ex P [* (kx^xf + fcptfpf )] 
for all fex, fcp S R. Inverting the Fourier transforms we deduce 
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